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Abstract 

A new generation of instruments will reveal the microwave sky at high resolution. We focus on one of 
these, the Atacama Cosmology Telescope, which probes scales 1000 < I < 10000, where both primary and 
secondary anisotropies are important. Including lensing, thermal and kinetic Sunyaev-Zeldovich (SZ) effects, 
and extragalactic point sources, we simulate the telescope's observations of the CMB in three channels, then 
extract the power spectra of these components in a multifrequency analysis. We present results for various 
cases, differing in assumed knowledge of the contaminating point sources. We find that both radio and 
infrared point sources are important, but can be effectively eliminated from the power spectrum given three 
(or more) channels and a good understanding of their frequency dependence. However, improper treatment 
of the scatter in the point source frequency dependence relation may introduce a large systematic bias. 
Even if all thermal SZ and point source effects are eliminated, the kinetic SZ effect remains and corrupts 
measurements of the primordial slope and amplitude on small scales. We discuss the non-Gaussianity of 
the one-point probability distribution function as a way to constrain the kinetic SZ effect, and we develop a 
method for distinguishing this effect from the CMB in a window where they overlap. This method provides 
an independent constraint on the variance of the CMB in that window and is complementary to the power 
spectrum analysis. 



1 Introduction 

Several survey telescopes (SPT^, APEX^, ACT^) will 
open a poorly probed regime of the microwave back- 
ground: arcminute scales at frequencies up to a few 
hundred GHz. The sensitivities of these instruments 
will be a few micro-Kelvin. These efforts will provide 
a large amount of high-quality data, but face a dif- 
ferent set of challenges than lower resolution surveys, 
such as WMAP^. For lower resolution experiments 
the primary anisotropies of the CMB are the prin- 
cipal signal. The major contaminant is the galaxy, 
whose emission is diffuse and drops off rapidly away 
from the galactic center. By contrast, for higher reso- 
lution experiments, secondary anisotropies dominate 
at small scales. Assuming the observations are away 
from the galaxy at high frequencies, the major con- 
taminant is extragalactic point source emission. For 
a wide range of interesting scales, point sources con- 
tribute substantially more power than the instrument 
noise. 



^ http: / / astro.uchicago.edu/spt / 
^ http: / /bolo. berkeley. edu / apexsz / 
^http: / /www. hep. upenn.edu/act/act. html 
*http:/ /map. gsfc.nasa.gov/ 



The statistical properties of the primary CMB dif- 
fer from those of these signals. Primary anisotropies, 
from the surface of last scattering, dominate the 
CMB on degree scales. The fluctuations are Gaus- 
sian, so the power spectrum sufficiently describes the 
anisotropy. Secondary anisotropies, such as lensing 
and the Sunyaev-Zeldovich (SZ) effect, depend on the 
late-time, nonlinear evolution of the universe. Their 
imprints need not be Gaussian. 

In this work, we investigate the prospects of these 
experiments to extract the power spectrum and other 
statistical information for the components that con- 
tribute at these angular scales and frequencies. We 
are particularly interested in determining the primor- 
dial power spectrum from the primary CMB. Such a 
determination would enable one to determine the am- 
plitude and slope of the fluctuations on small scales 
and, in combination with large scale observations, 
would allow one to place powerful constraints on 
models of structure formation. We model our investi- 
gations on the Atacama Cosmology Telescope (ACT), 
a 6 meter off-axis telescope to be placed in the Ata- 
cama desert in the mountains of northern Chile J^. 
act's bolometer array will measure in three bands 
from 145-265 GHz, with beams sized 1-2 arcminutes. 
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(See TableUl) ACT will survey about 100 square de- 
grees of the sky with high signal to noise. In section|2 
we simulate observations based on the specifications 
of ACT. 

The first problem in extracting the primary CMB 
information from observations is separating the com- 
ponents on the sky with distinct frequency depen- 
dences. Although many signals at these scales are 
not Gaussian, the power spectrum is still the most 
important of the statistics, and in section|31we extract 
the power spectrum with a multifrequency analysis. 
The power spectrum analysis requires knowledge of 
the frequency dependence of the signals and of the 
scatter in that relation. Our knowledge is presently 
incomplete, and any error biases the estimated power 
spectra. Our discussion explores the impact of esti- 
mating some of this frequency information from the 
data itself. 

The second problem in extracting the primary 
CMB information is that one of the secondary 
anisotropics, the kinetic Sunyaev-Zeldovich (kSZ) ef- 
fect (and its analogs like the Ostriker-Vishniac ef- 
fect and patchy reionization effect), has the same fre- 
quency dependence as the primary CMB. This com- 
plicates their separation, requiring an explicit tem- 
plate for the kSZ power spectrum. Again our knowl- 
edge is incomplete, and any error biases the primary 
CMB power spectrum. We address in detail the im- 
pact of secondary anisotropics on the estimation of 
the amplitude and slope of primordial fluctuations. 

Non-Gaussian information can help distinguish be- 
tween the components which cannot be separated 
using frequency information. It is difficult to make 
non-Gaussian analysis fully exhaustive, so one must 
choose some simple statistics. Here we explore the 
one-point probability distribution function, in section 
21 This can provide a consistency check on a kSZ tem- 
plate and can assist in signal separation. Finally, in 
section El we present our conclusions. 

2 Simulations 

We simulate the sky as five astrophysical components 
(CMB, kSZ, SZ, and two types of point sources) 
plus noise. The CMB is a Gaussian random field. 
The density field (used for lensing of CMB), kSZ, 
and SZ are produced from hydrodynamical simula- 
tions Point sources are Poisson distributed from 
source count models. We add the appropriate de- 
tector noise and convolve with the telescope beam 
(assumed Gaussian). 

We briefly examine the simulations in general, be- 
fore addressing each signal in turn. The simulation 



contains forty flelds, each 1.19° x 1.19°, totaling ~ 57 
square degrees, somewhat smaller than the ACT sur- 
vey. The results will therefore be appropriate for a 
smaller, more conservative mock survey. Through- 
out, we have used the following cosmological param- 
eters in a flat, ACDM model: VL^ = 0.32, fi{, = 0.05, 
VLk = 0.63, 0-8 = 1-0, with = 70 km/s/Mpc. 

We express our fields in temperature units, or as 
a temperature perturbation 6(n) — AT(n)/TcMB 
for line of sight n. Sample temperature perturbation 
maps of each signal are shown in Figure ^ We will 
refer to each image as we discuss the signals in greater 
detail. Signals which have a blackbody frequency de- 
pendence (CMB, kSZ) have the same temperature 
perturbation at all observed frequencies. For all other 
signals, the temperature perturbation will depend on 
the observation frequency. 

In Fourier space, we use the fiat sky approximation, 

e(n) = ^ y d\ Oi exp(zn • 1). (1) 

The symbol 1 is a two dimensional wavevector. The 
power spectrum of a signal or channel is defined 
by (|0i|^) — Ci. For each signal, figure |21 displays 
the power spectra and cross-correlations between the 
bands. Below we include a more detailed look at these 
spectra. 

ACT will have channels near 145, 217, and 265 
GHz, which in simulations we take to have delta func- 
tion frequency responses. These bands are typical of 
ground based experiments because they lie in win- 
dows between atmospheric lines. The band at 217 
GHz also takes advantage of the thermal SZ "null," 
described below in section Figure |21 show a sim- 
ulated patch of sky observed by ACT in these three 
bands. We discuss this image further below. 

In the next section, we discuss the hydrodynam- 
ical simulations which were the source of several of 
our signals. Then we study the signals and noise in 
more detail. Section 12.71 list caveats to our simula- 
tions. Section 12.81 summarizes the discussion of the 
simulations. 

2.1 Hydrodynamical simulations 

Hydrodynamical simulations provide the density field 
for lensing, the SZ and kSZ effects. These simulations 
are described in more detail elsewhere E El El ■ A 
100 Mpc periodic box was simulated at 512^ reso- 
lution with a moving mesh. At several redshift slices, 
the contents are projected onto a plane parallel to 
one of the box faces. These projections are stacked 
with random horizontal and vertical offsets to form 
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Band (GHz) 


Beam FWHM (arcmin.) 


Sensitivity per pixel (/itK) 


145 


1.7 


2 


210 


1.1 


3.3 


265 


0.93 


4.7 



Table 1: Frequency channels, beam full-width at half maximum, and detector noise (thermodynamic tem- 
perature) of mock ACT survey. 



CMB 



kSZx 10 



maps which look back through a cone of the sky. The 
benefit to this method is that only one box must be 
simulated to create a much larger set of maps. In our 
case, we have forty maps from one simulation box. 
The obvious downside to this technique is that a sin- 
gle structure in the simulation may be seen several 
times, at different orientations and stages of devel- 
opment, in different fields. Thus the family of rare 
objects in the maps may not be truly representative. 

We use the same simulations to obtain the con- 
vergence maps for CMB lensing, the comptonization 
parameter maps for thermal SZ, and AT/T maps for 
kinetic SZ. These are discussed below for each effect. 



Rad. pt. src. x 10 



SZ 



2.2 Lensed CMB 




Figure 1: These simulated signals contribute to one 
of forty 1.2° x 1.2° fields. All fields are in tempera- 
ture units with a common gray scale, although kSZ 
and radio source signals have been multiplied by 10 
for visibility. SZ is shown at -Trj = 2y. The CMB, 
SZ, and kSZ are not convolved with any beam. For 
visibility, radio and infrared point sources are con- 
volved with the beam of the channel in which are 
most prominent, respectively a 1.7 arcminute (145 
GHz channel) and a 0.9 arcminute (265 GHz) full- 
width at half-maximum Gaussian beam. 



We employed CMBFAST |B] to generate the unlensed 
CMB power spectrum to Z = 20 000. This power spec- 
trum was used to create a Gaussian random field, 
which is the unlensed CMB map. Lensing bends 
light from one part of the sky to another, so that 
the light observed from line-of-sight n in reality came 
from n -|- ^n. The real space displacement vector 
5n may be calculated via the Fourier space relation, 
M(l) = 2ilK,(l)/P. The convergence map for CMB 
lensing K(n) is defined as a line-of-sight projection of 
the density perturbation S = p/p— 1: 



«(n) = 



2c2 



XLS r(x)r(xLS - x) ^in) 



r{x 



LS) 



dx- (2) 



The integral is over the comoving radial coordinate 
X- Here xls is the coordinate of the CMB's surface 
of last scattering at recombination, the line element 
radial function r(x) is simply x for a Aa-t universe, 
Q.m is the matter density parameter, n describes po- 
sition (n, x), and a is the expansion factor of the uni- 
verse. Software from [7j computes the lensed maps 
from unlensed maps and the convergence (See also 
[5j). Hydrodynamical simulations provided the con- 
vergence calculated from the gas density, 5g. Here 
we assume assume gas traces matter, which should 
be valid on scales of interest here. The primary 
CMB has Gaussian fiuctuations, but lensing intro- 
duces non-Gaussianities by correlating the CMB with 
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the nonlinear matter field. 

A sample lensed CMB map is shown in Figure 
n The lensing has negligible visible effect on the 
maps. The CMB power spectrum is shown in Figure 
121 Because we examine the temperature perturba- 
tion power spectrum, any signal with a blackbody 
frequency dependence will contribute equally in the 
three bands. This is the case for the CMB. Since the 
maps are only 1.2° on a side the power spectrum bins 
have a width Al « 300, so the acoustic peaks are not 
resolved. The CMB is the dominant signal on scales 
larger than I ^ 1000. The major features are the 
damping tail, where the power falls exponentially at 
1000 < I < 4000, and the contribution from lensing, 
which dominates the power at I > 4000. 

2.3 Sunyaev-Zeldovich effect 

The SZ (or thermal SZ) effect is a change in the spec- 
trum of the CMB due to scattering off of hot gas in 
the potential wells of large structures (see [3 for 
comprehensive reviews). Scattered CMB photons are 
preferentially raised in energy. This distorts the en- 
ergy spectrum: in the direction of the hot gas, the 
CMB has fewer low-energy photons and more high- 
energy photons. The number of scatterings for a pho- 
ton passing through a cluster is low, so the CMB 
photons do not achieve thermal equilibrium with the 
electrons. Therefore, the SZ effect does not follow 
a blackbody spectrum. If the electrons are not rela- 
tivistic then the frequency dependence is 



e^^(fi) - -2y{n) 



x/2 



tanh(a;/2) 



(3) 



where x = hv/ksTcMB and the comptonization pa- 
rameter y is given by the integral of the electron pres- 
sure along the line of sight: 



the effect. At high electron temperatures, the formula 
for the frequency dependence needs relativistic cor- 
rections. These corrections in particular displace the 
SZ null. However, this effect has a minor impact for 
the application here: even if the electron temperature 
was increased everywhere to Tg ~ 20 000 K (holding 
the y-parameter constant), the SZ power spectrum 
remains a minor addition to the 217 GHz band. 

A sample SZ map, showing negative Rayleigh- 
Jeans temperature, is depicted in Figure ^ The SZ 
effect is most striking in clusters. The resulting signal 
is non-Gaussian. 

The SZ effect has been studied extensively with 
simulations (e.g. H [TTl El 11 d HI) and semi- 
analytic methods ^1 El ■ There are some differences 
in the results between different simulations and halo 
model predictions (see for a discussion) , but these 
are most prominent on very small scales, while in the 
relevant range around I 3000 the agreement in the 
power spectrum is quite good. The SZ power spec- 
trum from simulations used here [21 is shown in Fig- 
ure (21 The effect of observation frequency is appar- 
ent. The SZ effect is most prominent, and negative, 
in the 145 GHz channel, absent at 217 GHz, and is 
substantial at 265 GHz. At very large scales, the dis- 
crete nature of clusters is apparent, and the power 
spectrum is like shot noise, with Cf^ constant. At 
smaller scales, the power goes as Cf ^ oc for scales 
of 2000 < I < 10000. At yet higher I, the power falls 
faster. 

2.4 Kinetic Sunyaev-Zeldovich effect 

The kinetic SZ effect is due to the scattering of pho- 
tons off gas with a radial peculiar velocity. It may 
be expressed as an integral over the density weighted 
gas velocity projected in the line of sight direction. 



f / ^fcsT'e(n) , (=)kSZ/-\ 



Here ctt is the Thomson scattering cross section, rig 
is the electron density, and Te is the electron temper- 
ature. This expression is derived in the single scat- 
tering, non-relativistic limit. In the low frequency 
(Rayleigh- Jeans) regime, x <^ 1 and 9^^ — —2y. 
At roughly 217 GHz, the SZ effect is absent. This 
is known as the SZ null. At frequencies lower than 
the null, the spectrum is underpopulated relative to 
a blackbody, and the effect is seen as a temperature 
decrement. At higher frequencies, the spectrum is 
overpopulated, and the effect is a temperature incre- 
ment. This very specific deviation from a thermal 
spectrum is SZ's signature, and is vital for extracting 



v(n) • n 

aTXe{n)ne{n) exp[T(x)]adx- 

c 

(5) 

Here Xe is the ionization fraction of the gas and 
t(x) is the optical depth to comoving radius x- As 
a Doppler shift, this fluctuation affects only the ra- 
diation temperature, so the spectrum of this effect 
is still blackbody. Photons scattering from electrons 
with radial velocity away from us show a tempera- 
ture decrement and vice versa. In our simulations, 
we assume a spatially homogeneous ionization frac- 
tion Xe- We discuss this assumption in section [TTI 
Figure ^ shows an example kSZ map. Note the as- 
sociation between the kSZ effect and SZ effect. The 
same gas causes both the kSZ and the SZ effect, but 
the cross-spectrum vanishes because, on average, half 
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Figure 2: Auto- and cross-power spectra from the ACT bands, in bins of width « 300. C; measures the 
power in temperature perturbation. Line type describes the frequency dependence of the signals. "145-145" 
denotes the power spectrum in the 145 GHz channeL "145-217" denotes the cross correlation between the 
145 GHz and 217 GHz channels. Others correlations are labeled similarly. Each signal is depicted in a 
separate panel. ACT measures the sum of all signals and noise, shown in the panel marked "combined." 
At each frequency in the combined signal, the CMB dominates at large scales. Small scales are dominated 
by experimental noise and infrared point sources. Effects with a Planck spectrum (CMB/kSZ) do not vary 
with frequency. The SZ 145 GHz-265 GHz cross correlation is negative, and is shown in absolute value. 
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of clusters have a peculiar velocity towards us, and 
half away from us. The equivalent of kSZ at higher 
redshifts is called the Ostriker-Vishniac effect. The 
second order term dominates over the linear effect, 
which is strongly suppressed. This effect is thus also 
expected to be non-Gaussian. Figure|21shows the kSZ 
power spectrum. Like the CMB, kSZ is equally rep- 
resented in all three channels. At higher I, we have 
roughly Cj^^z ^ ;-2 f^^, 4Q00 < i < 15000 [HI before 
falling off at high I (see j^l ^] for other simulations 
of the kSZ). Modeling the kSZ remains quite uncer- 
tain, ft is sensitive to the details of reionization which 
are not yet well understood and the power spectrum 
is probably uncertain to an order of magnitude |19j . 



by 

^CMB L"-' JTcmb "'0 ^ 

where the derivative of the Planck spectrum dB / dT is 
used to convert into temperature units, and dN/dS is 
the differential source counts. After masking of bright 
sources, described below in section 12.5.31 the radio 
point source power spectrum is plotted in Figure [51 
The radio power decreases with frequency. Since the 
sources are assumed to be spatially uncorrelated, the 
power spectrum is white: C['*'^ is constant. Radio 
point sources do not dominate in any of the bands at 
any scale, but proper treatment of them is important 
to get unbiased power spectrum results. 



2.5 Point sources 

Two populations of extragalactic point sources are 
the main source of astrophysical contamination for 
the survey. These are radio galaxies, brightest in 
the lowest frequency channel, and dusty galaxies, 
brightest in the highest frequency channel. Dusty 
infrared point sources are the more serious con- 
taminant. Point sources dominate the secondary 
anisotropics in the two higher frequency bands, and 
obey Poisson statistics (assuming no correlations). 
Thus the statistics are non-Gaussian. In the follow- 
ing, we discuss each family of sources, point source 
removal by masking, and the frequency coherence of 
the source population as a whole. 



2.5.1 Radio point sources 

Synchrotron emitting point sources feature promi- 
nently in the WMAP maps. They are less promi- 
nent here because of the frequency dependence, but 
still make a contribution. We model radio sources 
using a fit to the source count model of [201 based 
on |21) . For each flux level, we compute the number 
of sources per field according to a Poisson distribu- 
tion, where the average density is given by the source 
counts. We place the point sources without spatial 
correlation, as this is simplest, and any correlations, 
particularly for faint sources, are diminished by pro- 
jection. The flux of these radio sources decreases with 
increasing frequency, typically scaling as i/""'^ to 
For our simulations, we use a uniform random devi- 
ate for the exponent of the frequency dependence for 
each source, and compute the flux in each of the ACT 
bands, converting to temperature units. 

The power in uncorrelated point sources is given 



2.5.2 Infrared point sources 

Dusty galaxies reprocess starlight to emit strongly in 
the infrared. To model these sources, we use source 
counts at 353 GHz. These source counts are based 
on a model of the optical emission of galaxies com- 
bined with a model for interstellar dust re-emission 
in the far infrared of absorbed optical light 22 . This 
model's counts are within a factor of ~ 2 of the 
SCUBA counts at the same frequency. We place them 
in the fields in the same manner as the radio sources, 
with no spatial correlations. The flux of these sources 
depends on frequency roughly as z^"^'^, with some scat- 
ter. Thus infrared sources become more prominent 
at higher frequency. For our simulation, we chose 
frequency dependences of ly'^-v'^. 

After masking out the bright sources fsection r2.5.3|) 
the infrared point source power spectrum is plotted 
in Figure [2j The IR power increase with frequency. 
Again the power spectrum is white, since the sources 
are assumed to be spatially uncorrelated. Infrared 
point sources are more contaminating than the radio 
sources in the higher frequency bands, and are the 
major contaminant in these bands until the instru- 
ment noise dominates at high I. At 145 GHz, infrared 
sources are dimmer than the radio sources, but there 
are many more of them, so the power in each is about 
the same. The power in infrared sources increases 
with frequency. 

2.5.3 Source Masking 

To some extent, the brightest point sources can be re- 
moved individually by thresholding, frequency mod- 
eling, and masking. We explore this procedure. At 
what level may we reasonably (and conservatively) 
detect and remove point sources? What is the im- 
pact on a power spectrum analysis? Is information 
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from surveys outside the ACT survey helpful in con- 
straining the point source contribution? 

For simplicity in our simulations, we do not at- 
tempt to extract or mask point sources individually. 
Rather we impose a flux limit in our simulation, imag- 
ining that the most offending point sources have been 
already excluded. 

Our strategy for identifying point sources focuses 
on the 145 and 265 ACT bands, where radio and in- 
frared sources are most prominent, respectively. We 
spatially filter the maps to emphasize point sources 
and de-emphasize the CMB and detector noise. As a 
side effect, this also enhances SZ clusters in the maps. 
The filter we use is the reciprocal of the channel's 
power spectrum (Figure but any similar broad fil- 
ter would work. We set a flux thresholds, and identify 
sources. 

A +5(7 threshold at 145 GHz identified radio point 
sources down to 4 mJy. At 145 GHz SZ clusters are 
negative, so there is no chance to confuse SZ sources 
as point sources. (We ignore the possible correla- 
tion between sources and clusters, which may lead to 
the unintended elimination of some of the SZ clus- 
ters and thus reducing the SZ signal.) At 265 GHz, 
a +5a threshold identifies infrared point sources to 
5.5 mJy. This threshold is sufficiently high that only 
the very brightest SZ clusters could be misidentified. 
Source shape and frequency information should elim- 
inate any confusion. Thus we adopt these flux cuts 
in our simulations. 

As point sources are masked out, we reduce the 
survey area. The masked area can be calculated from 
the source counts. We assume the largest beam (1.7 
arcmin) as the rough scale of a mask. The fraction of 
the survey remaining unmasked after N sources are 
excised is roughly (1 — (mask) /47r /sky assuming 
masks can overlap. The radio and point source cuts 
described above will remove about 2 percent of the 
survey area. 

Might we make a more aggressive flux cut to re- 
duce the power in power sources, using multifre- 
quency information or a different point source detec- 
tion scheme? Yes, but to do so is not always useful. 
In a power spectrum analysis, a measured signal has 
variance like 

/^signal . ^ps . ^noise\2 

var(Cf8"^') oc ±}_L±h L. (7) 

/sky 

The purpose of masking sources is to reduce Cf^ , thus 
reducing the variance of our measured signal. Note, 
however, that masking also reduces /sky, which can 
serve to increase the variance if we mask too many 
sources. We shall see that for infrared sources, even 



simple methods for identifying point sources identi- 
fies so many sources that masking them all would be 
counterproductive. 

This can be understood visually in Figure 13 A 
simple multifrequency combination can readily iden- 
tify infrared point sources below the 5 mJy flux cut at 
265 GHz applied that image. For example, compare 
the 265 and 145 GHz channels and imagine masking 
every visible point source. The bulk of the survey 
would be consumed by the mask. 

As a more concrete example, lowering the flux cut 
on infrared sources from 5.5 mJy to 1.6 mJy at 265 
GHz removes ~ 35 percent of the survey area. Such 
a flux cut reduces the power in such sources by ^ 20 
percent. For a signal whose variance is dominated by 
faint point sources, this aggressive flux cut improves 
the variance only slightly. For a signal comparable in 
amplitude to the point source noise, the variance ac- 
tually worsens. At these flux levels the power in point 
sources is not dominated by a few bright sources, but 
by a mass of background sources. 

Thus it may be sufficient, or even advantageous, 
to use a more conservative point source removal. 
Whether or not to aggressively pursue point source 
removal depends on the particular application. Given 
a model for the signal and the source counts, one 
could in principle minimize the variance using expres- 
sions |H1 and [71 Since we are examining signals over 
a wide range in Z, and the relative power in point 
sources and other signals varies greatly, we will sim- 
ply use the more conservative cut for infrared sources. 

The differential counts for radio sources at the level 
of our flux cut are shallower, so masking is useful. 
It would also be helpful to compare the ACT sur- 
vey at 145 GHz to radio surveys covering the same 
area, to assist in identiflcation and masking of radio 
sources. This could reduce the variance caused by ra- 
dio sources, if such information were available. How- 
ever, we will proceed as if the fields from ACT are 
the only measurements of those sources available to 
us. Information from outside surveys to identify and 
mask offending radio point sources is helpful, but not 
necessary for a power spectrum analysis with ACT. 

2.5.4 Frequency Coherence 

For some purposes, it is useful to treat all sources as 
a single population. In this case, the details of the 
multiple source populations, frequency dependence, 
and scatter, may all be subsumed under a set of cor- 
relation coefficients which describe the frequency co- 
herence across bands. If the correlation coefficient for 
point sources between two bands is unity, then one 
band is a scaled version of the other. In our point 
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source model, the correlation coefficient between the 
217 and 265 GHz bands is 0.99, close to unity. The 
deviation from unity is dominated by scatter in the 
scaling of the IR sources. However, the point source 
population in the 145 band has a significant contri- 
bution from radio sources. We find the correlation 
coefficient between 145 and 217 GHz is 0.83. Between 
145 and 265 GHz the correlation is 0.76. Ignoring this 
frequency incoherence has serious consequences when 
estimating the power spectrum, as we note below. 

2.6 Detector noise 

We add Gaussian distributed detector noise to these 
simulated fields. In reality, the cosmic signals will be 
convolved with the beam of the telescope, and the 
detector will add white noise. Here we find it more 
convenient to take the mathematically equivalent ap- 
proach of de-convolving the beams from the three 
channels. Thus the cosmic signals are untouched and 
the noise is correlated, with noise power increasing at 
small scales. Assuming the noise and Gaussian beams 
in Tabled the power spectrum of noise is given by: 

Where a stands for the channel, aa is the RMS noise 
per beam-sized pixel, and the beam full width at half 
maximum ba must be converted to radians. Figure 
El shows the noise power spectrum in the three chan- 
nels. At large scales the noise is similar in the three 
channels. At smaller scales, the noise dominates in 
the 145 GHz channel first, then at 217 GHz, then 265 
GHz. 

2.7 Caveats and other signals not in- 
cluded 

Here we list several caveats and further possible im- 
provements that we did not include in our simula- 
tions. One caveat is that the simulations have cig = 1 
normalization, which is at the high end of what is cur- 
rently favored. In particular, this may make the SZ 
fiuctuations larger than in reality, since SZ is rapidly 
increasing with the matter power spectrum ampli- 
tude. 

The hydrodynamical simulations we incorporate 
into our mock survey assume a spatially uniform tran- 
sition at reionization. In reality, the ionized frac- 
tion will depend on where ionizing photons are pro- 
duced, their ability to propagate, and their efficiency 
to ionize. Since these quantities probably vary from 
place to place, reionization is not likely to be uniform. 



Modeling of this so-called patchy reionization is dif- 
ficult, and the impact remains somewhat uncertain. 
Thus we have not included it in this analysis. How- 
ever, its impact may be significant: in recent mod- 
eling by UJji. patchy reionization typically increased 
the power in the kSZ effect by an order of magnitude 
(see also 

The modeling of point sources is still quite uncer- 
tain. Source number counts, clustering, frequency 
dependences and possible correlations with SZ all re- 
quire improvements as better information becomes 
available. 

Additionally, the infrared point sources may be cor- 
related with CMB lensing, with this cross-correlation 
possibly detectable by Planck [23] using the 545 GHz 
band. We have not included this effect and do not 
consider it. The peak scale of the effect at I ~ 150, 
frequency scaling of point sources, and /sky of the 
experiments favor Planck, not ACT, to detect this 
effect. 

We have not included any galactic contaminants 
in these simulations. Because the survey covers a 
small fraction of the sky, we have assumed that a 
clean enough window could be found to look out of 
the galaxy with minimum contamination or that a 
slowly varying galactic component can be projected 
out of the analysis. 

Finally, the model of the instrument we include is 
simplistic. The actual survey will have a nontrivial 
geometry, non-uniform noise with a 1// component, 
and non-Gaussian beams. 

2.8 Summary of simulations 

We have constructed a 57 square degree mock CMB 
survey according to the specifications of ACT. It in- 
cludes five astrophysical components. The CMB is a 
Gaussian random field, lensed by a convergence map 
from hydrodynamical simulations. The same simula- 
tions produced the gas pressure (for SZ) and momen- 
tum (for kSZ). Both SZ and kSZ have well defined fre- 
quency dependences. SZ is negative in the 145 GHz 
channel, zero in the 217 GHz channel, and positive 
in the 265 GHz channel. Kinetic SZ scales with fre- 
quency exactly as the primary CMB. The simulations 
include two types of point sources. Radio sources dim 
with frequency and have their greatest contribution 
at 145 GHz and little contribution at higher frequen- 
cies. Infrared point sources brighten with frequency, 
and are prominent in the 217 GHz and 265 GHz 
bands. We have conservatively removed the bright- 
est radio and infrared sources. A more aggressive flux 
cut on radio sources can help a power spectrum anal- 
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Figure 3: A simulated 1.2° x 1.2° field in the bands of the ACT telescope. Pixel value indicates the difference 
in temperature from Tcmb • The mean flux increases with frequency because of the increasing background of 
galaxies shining in the far infrared. Small scale temperature decrements at 145 GHz indicate the SZ effect. 



ysis, if more information about the sources is avail- 
able from other surveys, but we assume it is not. A 
more aggressive flux cut on infrared sources begins to 
degrade the survey area, and depending on the appli- 
cation may not improve the variance in a power spec- 
trum analysis. We have to include infrared sources in 
any analysis because they are too numerous to excise. 
We include instrument noise and beam from Tabled 
The instrument has highest resolution at 265 GHz 
and lowest at 145 GHz. 

The assembled signals, convolved with the tele- 
scope beam and added to the noise, are depicted in 
the ACT channels in Figure |3| The striking features 
are the primary fluctuations on large scales, bright 
SZ clusters (as a decrement) at 145 GHz, and the 
substantial brightening of infrared point sources at 
higher frequencies. 

The power spectra of these channels are shown in 
Figure |21 The power spectrum in each of the three 
channels is dominated by the CMB on the larger 
scales and instrumental noise on the smaller scales. 
At the intermediate scales of this survey, SZ is the 
most significant component in the 145 GHz band. 
In the bands at 217 GHz and 265 GHz, infrared 
point sources are the major contribution to the point 
sources at intermediate scales. 

At this point we may draw several conclusions 
about these simulations. The CMB is sub-dominant 



on a wide range of scales accessible to this survey. 
The dominant cosmological features are secondary 
anisotropics, but these suffer from serious pollution 
from extragalactic point sources, particularly infrared 
sources in the high frequency bands. These astro- 
physical components exceed the noise over a wide 
range of interest. In section |21 we proceed with a 
power spectrum analysis and return to non-Gaussian 
features in^ 



3 Power spectrum analysis 

From our simulated maps, we seek to measure the 
power spectra of our various cosmic signals. We 
use a minimum-variance quadratic estimator, which 
can also be viewed as an iterative procedure to find 
maximum- likelihood solution |25j . The power spec- 
trum estimates are unbiased. For components that 
cannot be separated using the frequency information 
(formally in this case the Fisher matrix becomes sin- 
gular and the errors infinite) it is better to treat 
them as a single component and then try to sepa- 
rate them using additional considerations such as a 
prior knowledge of the power spectrum shape for the 
signals. For some applications, power-spectrum esti- 
mators that involve the intermediate Wiener filtered 
maps are minimum variance '26' . However for this 
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application, they are not necessarily minimum vari- 
ance. Our approach here is a generalization of those 
treatments and should improve upon those estima- 
tors. As an added bonus, the effects of imperfect 
cleaning of one component on the mean and errors of 
other components are automatically included, for the 
estimator is unbiased and minimum variance (con- 
sidering the imperfect cleaning, see For non- 
Gaussian signals, the estimator is unbiased, but not 
minimum variance. In this section, we discuss this 
estimator and its relation to the Wiener filter. We 
discuss the impact of scatter in the frequency de- 
pendence of signals. We measure the power spectra 
in several cases, making different assumptions about 
our knowledge of the point source frequency depen- 
dence. Finally, we discuss the importance of cleaning 
out kSZ in determining the amplitude and slope of 
primary fluctuations. 

3.1 Multifrequency estimator 

In this section we describe the power spectrum es- 
timator we employ. We will reference a particular 
Fourier mode in the flat sky approximation by its 
wavevector 1. We use I to denote the magnitude of 
this wavevector. We organize our data into a vector 
e = {ejj}. The subscript a refers to which band the 
data comes from. We will reserve Greek indices to 
refer to the channels of the detector, aPj. . . . We say 
that the data is given by the sum of the response of 
the instrument to the signals and the noise. Dropping 
the mode index, we write the data in channel a as , 
the signal as Sa , and the noise as . For a particular 
multipole, we characterize the data, signals, and noise 
by their covariances (eae^) = Ca/s, {saS*^) = Sap and 
{nan*p) = Naf}. Note that we could additionally as- 
sume the noise between channels is uncorrelated, but 
it is not necessary to do so. We assume that the 
signal and noise are uncorrelated, and that the signal 
depends on a set of parameters Z = Zp. Thus, restor- 
ing the multipole dependence, the covariance matrix 
is 

C = (eet) = Clp5,v (9) 



where 



Cq/3 — S^pi'Zi) -f N^p. 



(10) 



The covariance matrix C has both frequency channel 
and and multipole indices, and we have assumed that 
different multipoles are independent, hence the ^n'. 

In a Gaussian model, the likelihood function for 
the data is 

L(e) = (27r)-^/2det(C)-i/2exp(-ie^C-ie). (11) 



A second order expansion in Zp gives 

lni(Z + (5Z) = lnL(Z)+^ 

p 

1 ^ 92 ini(Z) 



ainL(Z) 



pp' 



dZ.,dZ.,, 



where 

ainL(Z) 



_a^lnL(Z) ^ etc-^C-^C-e 



5Zp5Zp,{l2) 



1 dC 



dZpdZp, 



dZp dZp. 



The ensemble average of the second expression is the 
Fisher matrix 



F, 



Note that for a given value of the trace accounts for 
the sum over {21 + l)/sky modes. At the maximum 
likelihood value the first derivative of the likelihood 
function vanishes, so we use the Newton- Raphson 
method to find the zero of the derivative. This leads 
to the minimum variance quadratic estimator for the 
parameters Z [231211 EE] ■ 



P' 



bp = tr[C-i — C-iN] 



(15) 



We use Zp to denote the estimate of Zp. 

In this formalism, the covariance matrix of the es- 
timated parameters, written Cpp', is given by the in- 
verse Fisher matrix: 



C 



VP' 



{ZpZpi) 



pp' 



(16) 



This technique requires knowledge of the covari- 
ance C. We may estimate this by taking a smoothed 
version of the data. If the covariance is incorrect, the 
estimator is still unbiased, but the error bars will be 
incorrect. 

The parameters we desire to estimate are the power 
spectra of the signals, defined in a way which does not 
depend on the frequency channels of our instrument. 
We denote our signal with s = sj,^. Here m indi- 
cates which signal we are referring to. We will also 
use the (cross) power spectrum of signals, denoted 
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^mn = ('sJn'Sn)^ir- We iiitroduce the frequency re- 
sponse 'R-afimn to relate the spectra of the signals in 
the channels to the spectra of the signals S'^„, how- 
ever we have chosen to define them. Note that S'^„ 
is symmetric in m and n, so only one ordering of each 
pair is required. Then the cross power spectrum from 
the combination of channel a and channel (3 will be 
given as: 

mn 

We discuss the frequency response in more detail be- 
low. To estimate the power spectrum of the signals 
we set Zp = Sl^^. Note that in this case the index 
p enumerates each combination of Imn. In this case 
the appropriate derivative of the covariance is given 

by 



dZ„ 



L/ YTi n 



(18) 



The final expression in the brackets is shorthand for: 
if (to — ml and n 



1 



ri ) or 
(m ~ n! and n = to'), 
otherwise. 



(19) 



If we know the shape of the cross spectra and wish 
to estimate the amplitude, we can write the signal 
power in the bands as 



ran 



(20) 



where Amn encodes the information about the ampli- 
tude, and S\^^ is the shape. If we want to estimate 
this amplitude, then Zp = Amn- Thus the derivative 
of the covariance is given by 



del, _ 
dZp OA 

r. 



del, 



i 

mn 



(21) 



If we know the shape of some signals, but not oth- 
ers, we may use the two expressions (fll^ and (PT|l in 
combination. In these expressions we have assumed 
that the power spectra of the components are un- 
known, but that their frequency dependence is known 
(and hidden in TZaf3mn coefficients). If the param- 
eters which determine the frequency scaling are not 
known then one can generalize this further, add these 
parameters to the estimate, and maximize the likeli- 
hood over them as well, provided the Fisher matrix 
does not become singular. The procedure of taking 
derivatives of the covariance matrix is similar, so we 
do not give such expressions here. 



3.2 Relation to Wiener filter 

If all signals have a spatially uniform frequency de- 
pendence, then sjj, = J2m Ramsln for all 1. The cor- 
relation coefficient of modes between frequency chan- 
nels will be unity (neglecting noise). In this case we 
can factor the frequency response into a simpler form: 



(22) 



We may put the components into a vector R„i = 
Ram- This allows us to compactly express the 
Wiener-filtered estimates of the signal maps, which 
minimize the variance in reconstruction errors. The 
same expression holds for all modes, so we drop the 
mode index 1. The Wiener estimate is 



(23) 
(24) 



The Sm are the Wiener filtered estimates of all sig- 
nal maps. From these estimated maps of the signals, 
we can compute the power spectrum, taking into ac- 
count the contamination from the other signals to the 
Wiener filtered maps. In the case of spatially uniform 
frequency dependence, this procedure is identical to 
the multifrequency estimate in the previous section. 
The Wiener estimate requires the power spectrum 
Sim, or we could use a power spectrum estimate and 
iterate. 

For signals with non-uniform frequency depen- 
dence, it may not be possible to write a vector 
which satisfies (|^ . A spatially- averaged frequency 
dependence may still be used to produce a Wiener- 
type estimate, but it is not guaranteed to be an unbi- 
ased or a minimum variance reproduction of the sig- 
nal. In addition, the power spectrum obtained from 
these maps can be substantially biased. This is be- 
cause a power spectrum estimate which uses a Wiener 
filtered map as an intermediate step does not account 
for the scatter in the frequency dependence. If this 
scatter is substantial, the Wiener filter may be in- 
appropriate. (See for making minimum variance 
maps of signals with good frequency coherence in the 
presence of incoherent foregrounds. To make a map 
of an incoherent signal, like a point source template, 
this approach may not be minimum variance. See 
27 for a generic model of frequency incoherence.) 

3.3 Frequency dependence and scat- 
ter 

To apply the multifrequency filter, one needs the 
noise power spectra and signal frequency depen- 
dences. We assume the noise power spectrum in the 
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three channels is well known. The frequency scalings 
of CMB and kSZ signals are constant in temperature 
units, so the relevant components of 'R-apmn are unity. 
For SZ, the frequency dependence is from equation|31 
The frequency dependence of point sources are poorly 
known. Significant modeling or outside observations 
are needed to fill in the remaining components of 
T^a0mn ■ Thcsc cauuot be completely determined from 
ACT alone in an unbiased way. If they are added to 
the parameter vector Zp without any additional as- 
sumptions, the Fisher matrix becomes singular. This 
means these parameters are completely degenerate 
with the power spectra: there exists a family of power 
spectra and frequency dependences which reproduces 
the covariance, and it is impossible to distinguish be- 
tween individual members of this family. 

It is possible, however, that for given models of the 
point source emission, the parameters of the model 
may be added to the estimate without forcing the 
Fisher matrix to be singular. Additionally, we can use 
the high I part of the spectrum, which is dominated 
by point source emission, to help constraint frequency 
dependence, at the expense of biasing our estimator. 
We use this in the following section. 

In our simulations, the frequency scaling differs 
substantially from point source to point source. The 
correlation coefficient between modes at different fre- 
quencies is less than unity, so the cross spectrum is 
less than predicted from the auto-powers. It may be 
senseless to produce a Wiener filtered signal estimate 
based on a spatially averaged frequency dependence. 
We do not use a spatially averaged frequency depen- 
dence in our power spectrum estimations, for we find 
it produces greater than 2a biases in the power mea- 
surement. The conclusion is that it is crucial to ad- 
dress the scatter in the frequency dependence of point 
sources. 

ACT and similar experiments will identify a popu- 
lation of bright sources, which may be used to study 
their frequency scaling. One should apply this in- 
formation to a power spectrum analysis with some 
caution, because the sources which contaminate the 
power spectrum analysis are dimmer than the popu- 
lation which is convenient to study. This may mean 
that contaminating sources are a different population, 
more distant, younger and less evolved, or some com- 
bination. Dust emission is complicated, and the emis- 
sion spectra of the point sources likely depends on 
the relative abundances of the dust species present. 
If the populations are different, it will be difficult to 
build an accurate model of the emission from these 
sources. In addition, bright sources may result from 
the confusion of two dimmer sources. Measurements 



of the scatter of source frequency dependence is then 
difhcult to interpret. 

High-resolution intcrferometric measurements 
would help in studying the source population. In 
particular, ALMA^ has frequency coverage in the 
ACT bands. It should have sufficient sensitivity 
to pick out sources which are confusing ACT, and 
sufficient angular resolution to resolve them. As 
mentioned, masking infrared sources to a low flux 
cut is not very promising, but ALMA would be 
useful as a tool to explore the frequency dependence 
and scatter of sources. 

3.4 Measured signal power spectrum 

We measure the power spectrum in several cases as- 
suming differing prior knowledge. Our list of cases is 
not exhaustive, but illustrates techniques one might 
explore. We always assume the frequency dependence 
of CMB, kSZ, and SZ. We enumerate our assump- 
tions, and the parameters to estimate in each case. 

The first two cases assume we have perfect knowl- 
edge of the point source frequency scaling and scatter. 

Case 1. We assume we know the frequency de- 
pendence of the point sources. The parameters 
to estimate are the power spectra, in bins, of 
CMB-|-kSZ, SZ, IR and radio point sources. 

Case 2. We assume we know the power spectrum 
shape and frequency dependence of the point 
sources. The parameters to estimate are the 
binned power spectra of CMB-|-kSZ and SZ, and 
2 amplitudes, for the IR and radio power spectra. 

The next two cases assume nothing about the point 
source frequency dependence. 

Case 3. We assume we know the shape of the point 
sources, and that the CMB+kSZ makes no con- 
tribution at I > 15000. The final condition en- 
sures a non-singular Fisher matrix. We treat the 
point sources as a single population. Parame- 
ters to estimate are the binned power spectra 
of CMB-|-kSZ and SZ, and 6 amplitudes, corre- 
sponding to the point source amplitude in the 
cross-correlations between the bands. This is 
equivalent to estimating the frequency depen- 
dence of the sources. 

Case 4- Like case 3, but without assuming the shape 
of the point source power spectrum. We treat 
point sources as a single population, assuming 
that the scatter in the frequency dependence is 

^http: / /www. alma. nrao.edu/index. html 
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constant as a function of scale, but we do not 
assume what it is. Parameters to estimate are 
the binned power spectra of CMB+kSZ and SZ, 
the 3 binned auto-power spectra of point sources 
in the three bands, and 3 correlation coefficients, 
which describe the scatter in the frequency rela- 
tion. 

The final two cases separate kSZ from the CMB using 
a template for the kSZ power spectrum. 

Case 5. Like case 2, including point source fre- 
quency and power spectrum shape information, 
but with two more assumptions. First, we as- 
sume the shape of kSZ power spectrum. Second, 
we assume the CMB makes negligible contribu- 
tions to the power for I > 6000. The second addi- 
tional assumption prevents a singular Fisher ma- 
trix. The parameters to estimate are the binned 
power spectra of CMB and SZ, and 3 amplitudes, 
for the kSZ, IR, and radio power spectra. 

Case 6. Like case 3, including only point source 
power spectrum shape information, but with 
the additional assumptions of case 5. Parame- 
ters to estimate are the binned power spectra of 
CMB-t-kSZ and SZ, 6 point source amplitudes, 
and the amplitude of the kSZ spectrum. 

If the parameters we choose to estimate can faithfully 
represent the real world, our estimate is unbiased. 
However, in any of our cases, an incorrect assumption 
can bias the estimate. Note that case 4 requires the 
least outside information. 

Where we use a frequency dependence, we use 265 
GHz as our reference frequency for SZ and infrared 
sources, and 145 GHz for radio sources. This choice 
is a convention, and has no bearing on the quality of 
the estimate. 

We estimate the covariance on the measured power 
spectra in two ways. First we use the inverse Fisher 
matrix, so the error on the estimate of power bin 
p is {FppY/"^. Second, we computed the covariance 
from the estimates on our 40 realizations. In Figure 
^ we show the errors on these power estimates as a 
function of L In the regime where instrumental noise 
dominates (at sufficiently high Z), the inverse Fisher 
matrix provides a good estimate of the errors. Else- 
where, we find the inverse Fisher matrix underesti- 
mates the error, due to the non-Gaussianities. Exper- 
iments of this type will likely need to rely on Monte 
Carlo simulations for their errors. 

We now discuss each case in turn. The assump- 
tion in case 1 and 2 of point source frequency knowl- 
edge is probably dangerous if we do not have such 



knowledge. A poor model will bias the power spec- 
trum in a way which may be hard to predict. On the 
other hand, if we do have a good understanding of the 
source population (from ALMA perhaps), cases 1 and 
2 represent ideal situations. For case 1, the measured 
power spectrum is shown in Figure [S] In this plot 
we use the errors evaluated over our independent re- 
alizations. Except for radio sources, we recover the 
power spectra well. This provides information about 
the shape of the power spectrum, which we have not 
assumed. Our result in case 1 is unbiased. 

We compared the inverse Fisher matrix and the co- 
variance matrix from the realizations. In case 1, the 
inverse Fisher matrix is diagonal in I. However, we 
find that the actual covariance matrix is not. The co- 
variance between the CMB-|-kSZ power bins is nearly 
diagonal in this case, but to Z of a few thousand, the 
SZ power bins are strongly correlated. 

For case 2, assuming the shape of the point 
source power spectrum improved the errors on the 
CMB-f-kSZ power spectrum by a factor of 4 relative 
to case 1. The errors on SZ are marginally improved. 
The power spectrum bins are somewhat correlated. 
In case 2, the error in the power spectrum amplitude 
in IR and radio point sources is AC;/C; — 5.0 x 10^^ 
and /S.C1/C1 = 0.20 respectively. Recall that this 
case combines all the information on the point source 
spectra into two amplitudes. The inverse Fisher ma- 
trix underestimates this error by a factor of 2. We do 
not plot case 2 because it is unbiased and similar to 
case 1, with smaller error bars, although the bins are 
more correlated. 

Case 3 does not depend on prior frequency infor- 
mation, but has its own problems. Case 3 depends on 
an incorrect assumption that only point sources con- 
tribute at high /, while kSZ continues to have some 
small contribution at these I. Thus the power spec- 
trum we deduce for point sources is slightly high, and 
this biases our measurement somewhat. This is rela- 
tively independent of the / cutoff. We plot the mea- 
sured power spectrum in Figure |S1 where the bias is 
apparent. The power spectrum bins are also some- 
what correlated. However, we needed no model for 
the point source frequency dependence. Our model 
is biased by as much as Icr at high /, but the bias is 
probably under better control than it would be for a 
model of frequency dependence, because at least we 
know in which direction our estimate is biased. 

Case 4 assumes less about the point sources than 
case 3, because it does not assume a shape for the 
power spectrum. We plot the power spectrum of 
CMB-l-kSZ and SZ in Figure □ This case does not 
successfully recover kSZ. There is a systematic bias 
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Figure 4: The errors on the estimated power spectrum. Lines indicate errors derived from the Fisher matrix: 
i.PppY^'^ ■ Symbols indicate the measured standard deviation from 40 reahzations. The errors in case 5 are 
similar to case 2, and the errors in case 6 are similar to case 3. The bottom panels show only case 1. For 
cases which assume the shape of the point source power spectrum, plotting the errors as a function of I is 
not possible. 
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Figure 5: Estimates of the power spectra in case 1, where we assume frequency information, but no 
information about the shape of any power spectrum. The sohd hne is the true value of the power spectrum. 
The X symbols show the estimated value of the power spectrum. The dashed lines show 1 standard deviation 
above and below the estimate, evaluated over our 40 realizations. This estimate is unbiased. The width of 
the bins is Al « 300. 
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Figure 6: Estimates of the power spectra in case 3, where we assume the shape of the point source power 
spectrum, but assume nothing about frequency dependence. 
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Figure 7: Estimates for case 4. Assuming no information about the power spectrum shape or frequency 
dependence of the point sources, we lose the abiUty to rehably detect kSZ. The estimate for SZ is biased, 
but the statistical errors are large. The bins are highly correlated. 



in the SZ, but statistical errors dominate. The bins 
are strongly correlated. Systematic biases dominate 
statistical errors in the recovery of the point source 
power spectrum. The relative bias is greatest at 145 
GHz, where the point source contribution is the least, 
and decreases at higher frequencies. Note that in case 
4 especially, the Fisher matrix characterizes the errors 
poorly. 

In cases 5 and 6, the kSZ is separated from the 
CMB. This is necessary to allow parameter estima- 
tion based on the CMB. Because the CMB and kSZ 
are degenerate in frequency, this requires a model for 
the kSZ power spectrum. The estimator in this case 
uses the high / portion of the kSZ spectrum, where 
CMB is not significant, to calibrate the model of kSZ. 
kSZ is then subtracted at low /. We use a 1-parametcr 
model for the kSZ power spectrum where the shape 
is known but the amplitude is not. The approach 
generalizes to a multiparameter model. We do not 
plot case 5. For case 6, we plot the recovered power 
spectrum (Figure |SJ). We do not detect a bias on the 
CMB in either case, although the SZ power spectrum 
is biased in case 6 as it was in case 3. The CMB power 
bins for / > 3000 are correlated by the kSZ template 
subtraction. SZ bins are also strongly correlated. 

We now summarize our power spectrum analysis 
results. Given a good understanding of the frequency 
dependence of point sources, we may successfully es- 
timate the power spectrum of each component, pro- 
vided we treat the CMB and kSZ as a single com- 
ponent. With no prior knowledge of the frequency 
dependence of point sources, we may make a some- 
what biased estimate of the power spectra if we know 
the shape of the point source power spectrum. Know- 



ing neither the frequency dependence nor the shape 
of sources, we detect SZ, but cannot positively iden- 
tify kSZ. For us to have confidence in our estimates, 
we must trust our knowledge of the point source fre- 
quency dependence. This prospect is unclear. If we 
do not have confidence in our frequency knowledge, 
we may still make an estimate of the power spec- 
trum, but it will be somewhat biased. In our more 
optimistic cases, if we want to estimate CMB without 
kSZ, we require a parameterized model for kSZ. The 
power spectrum template will likely come from simu- 
lation. However, any parameterized fit to the power 
spectrum of the signal components may introduce a 
bias. 

3.5 Measuring primordial amplitude 
and slope 

In our model, the kSZ power is already comparable 
to the size of the CMB error bars at I ^ 2000, so if 
we do not know the kSZ power spectrum at all then 
we should not use the CMB power spectrum beyond 
there. With a model for the kSZ power spectrum we 
may use bins with higher as in our cases 5 and 6. To 
illustrate this difference, we compute the covariance 
matrix for primordial amplitude A and slope n about 
the fiducial values A = 1 {COBE normalized) and 
n = 1. To convert from errors on binned power to 
errors on parameters we use 

_ sr^ dCi idCi 

where Cqq' is the covariance matrix for parameters 
q, Cw is the covariance matrix for the binned power, 
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Figure 8: Estimates of the power spectra in case 6, where we assume the shape of the kSZ and point source 
power spectra, but assume nothing about the point source frequency dependence. 



consisting of the covariances computed for CMB. For 
this we use the CMB block of the covariance matrix 
for case 6. (The resuh using case 5 is similar.) We 
compute the derivatives of the power spectrum us- 
ing CMB FAST. Then for primordial amplitude and 
slope, we have the following covariance matrix if we 
do not extract kSZ. 







A 


n 


C(/ < 2000) = < 




2 X 10"^ 


-4 X 10"^ 




n 




1 X 10-3 



(26) 

However, if we can extract the CMB, the covariance 
improves. 
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4 X 10"'^ 


-9 X 10-* 


(27) 
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Thus the covariance increases by roughly a factor of 
5 for A and n if the kSZ is not extracted. If inhomo- 
geneous reionization raises the kSZ power, shortening 
further the lever arm of the CMB, it becomes more 
important to extract kSZ. 

To gain this additional information from the CMB 
on scales I > 2000 we need a template for kSZ. This 
template will come from simulations, and may be cal- 
ibrated on the kSZ measured at higher I. However, 
there will be uncertainties in the template because of 
missing physics, such as patchy reionization. There- 
fore the power spectrum analysis should be checked 
in as many ways as possible against the data. One 
way we explore in the next section is using the non- 
Gaussian information. 



4 One-point distribution func- 
tion analysis 

CMB and kSZ cannot be distinguished in a multi- 
frequency analysis. We have seen this in our power 
spectrum analysis in the previous section, in which 
we could only estimate CMB separately from kSZ by 
knowing the shape of the kSZ power spectrum be- 
forehand. However, if the data do not represent a 
Gaussian random field, and our data do not, then the 
power spectrum does not extract all the information 
available. 

Statistics which highlight non-Gaussianity are also 
particularly interesting because in the standard pic- 
ture, primary CMB fluctuations are Gaussian. Any 
non-Gaussianities indicate deviations from the stan- 
dard picture for primary fluctuations, or in our 
case, indicate the presence of secondary sources of 
anisotropy. 

In principle then, it is possible to use non- 
Gaussianities to tease apart the CMB and kSZ. In the 
following, we show this is possible, and explore one 
method for doing so. The statistic we choose is the 
histogram of pixel temperatures, that is, the 1-point 
probability distribution function (pdf). In the con- 
text of weak lensing, the pdf provides stronger con- 
straints than simple statistics such as skewness [3T| . 
This makes sense because the pdf incorporates all the 
moments of the field evaluated at a single point, such 
as variance, kurtosis, and so on. For a Gaussian field 
the pdf is Gaussian. For a highly non-Gaussian field, 
such as kSZ, the pdf function may show strong tails. 
The information in the pdf is complementary to the 
information in the power spectrum. The pdf con- 
tains information about non-Gaussianities, but not 
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Figure 9: The spatial filter used for the non-Gaussian 
analysis and the components at 217 GHz. The verti- 
cal scale of the filter in this plot is arbitrary. 

about spatial correlations, because it is evaluated at 
a single point. The power spectrum contains infor- 
mation about spatial correlations, but not about non- 
Gaussianities. 

Below, we devise a method for examining the pdf 
of our maps to extract parameters of the component 
signals, given a set of templates. Then we apply this 
method. We want to separate CMB from kSZ, so 
we consider the sky at 217 GHz. This is simplest 
because it eliminates the signal from thermal SZ. We 
want to focus on scales where neither the CMB nor 
kSZ completely dominates the other. Thus, before 
evaluating the pdf, we spatially filter the maps. In 
Fourier space, we multiply the maps by a Gaussian 
centered at / « 3600 with standard deviation cr; w 
750. In figure |51 this filter is shown with the the 
power spectra of the various components of the sky 
at 217 GHz. 

In the next sections, we describe the pdfs of the 
major signals at 217 GHz in this window, describe 
our method for separating kSZ from CMB, and apply 
it to our simulated maps. 

We make no claim that this is an optimum method 
for separating CMB and kSZ. At this point, this 
method is intended as a demonstration of the power 
of using simple statistics which are sensitive to non- 
Gaussianities. 

4.1 Signal one-point distribution 
functions 

Here we examine the pdfs of the signals with the goal 
of identifying features that distinguish CMB and kSZ. 



In addition, the infrared sources will be an unavoid- 
able contaminant at 217 GHz. The radio sources are 
not very significant, and as we have mentioned, may 
be helped by additional masking. The noise after 
spatial filtering is very low. After spatial filtering, we 
show the histograms for the major signals at 217 GHz 
in Figure [TUl 

The pdf of the unlensed CMB is Gaussian. Af- 
ter lensing, the distribution is still Gaussian because 
lensing simply remaps locations on the sky. After spa- 
tial filtering, the lensed CMB need not have a Gaus- 
sian distribution function, but we find the deviations 
in this case to be insignificant. 

The distribution of kSZ should be symmetric, be- 
cause there should be roughly as many clusters mov- 
ing towards us as away. We expect the pdf of kSZ, 
or any signal sourced by discrete objects, to have a 
non-Gaussian point distribution function. This is be- 
cause the signal strength is highly localized: the val- 
ues of pixels are extreme in the direction of the ob- 
ject, and zero elsewhere. The kSZ histogram shows 
tails well beyond that expected from a Gaussian pdf 
of the same variance. A pdf with power law tails, 
such as Student's t-distribution, is a much better fit, 
although we find the i-distribution to be too poor of 
a model to apply our method. 

We might expect the point sources to have a non- 
Gaussian pdf, since they are positive definite, which 
would seem to argue for strong skewness. This is 
counteracted by two effects. For a high density of 
sources, several sources are included in each beam, 
and the non-Gaussianity is diminished due to central 
limit theorem considerations. In addition, our spa- 
tial filtering removes long wavelength modes, caus- 
ing ringing about each source which can wash out 
skewness. We show the histogram for infrared point 
sources, after spatial filtering, in figurcEB it is nearly 
Gaussian. A wider window function reduces ringing, 
and produces a point source pdf which is more posi- 
tively skewed. 

We see that the tails of the kSZ distribution are a 
unique feature with which we can try to distinguish 
it from the other signals. 

4.2 Likelihood 

Given a set of pixelized data, we want to examine 
the likelihood of a trial distribution function. Let 
Pa (AT) be a trial pdf, which depends on a set of 
parameters a. For now, think of it as an abstract pa- 
rameterized model for our data histogram. We con- 
struct our trial pdf in the next section. For a set of 
independent pixels AT^, the likelihood of parameters 
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Figure 10: The histograms of the major signals at 217 GHz, after spatial filtering. Each pair of plots shows 
the same data, except the vertical axis above has a linear scale and below has a log scale. The distributions 
have been normalized to unity. The CMB is well fit by a Gaussian. kSZ is poorly fit by a Gaussian with the 
same variance. Infrared point sources are slightly skewed in the positive direction, compared to a Gaussian. 
At right we show the histogram of the sum of these maps, along with our best fit pdf from fitting and 
convolving the template pdfs, as described in the text. 



a is given by 

L(a) = []Pa(Ar,), (28) 

i 

which is just the probability of the first pixel, times 
that of the second, and so on. Computationally it is 
more convenient to work with the logarithm of the 
likelihood. We also work with a binned pdf, so we 
combine all the pixels for each bin, and sum over 
bins: 

logL(a) = J2 logP,(AT,), (29) 

3 

where Nj is the number of pixels in histogram bin 
j, and ATj is the value at the center of bin j. This 
gives us the likelihood of parameters a for model pdf 
Pa (AT) in terms of the histogram of pixel values Nj. 

If the pixels are not all independent, this is not 
the correct likelihood function. The true likelihood 
would take into account the covariance between pixels 
(or alternatively, between bins). However, we find in 
practice that the peak of the given likelihood function 
in is a good enough estimate for this application. 
The correlation length of our maps after spatial filter- 
ing is much less than the size of the maps. Thus our 
maps contain a large number of uncorrelated patches. 
If these patches are thought of as larger, independent 



pixels, then the likelihood we have written has the 
right peak, but may have a wrong normalization and 
width. In our method, we use only the location of 
the likelihood peak, while the errors are assessed us- 
ing Monte Carlo simulations. 

Our goal is to find the set of parameters a which 
maximize the likelihood. We use Powell's direction 
set method Because the data (the histogram of 
pixel temperatures) is noisy, the likelihood function 
is plagued by local maxima. We repeat our max- 
imum likelihood search with randomized initial po- 
sitions and search directions in parameter space, to 
ensure that the maximum we find is reliable. 

We compute errors on our parameters by the boot- 
strap method. We sample our sky map realizations 
(with replacement) to generate synthetic data his- 
tograms. We find the maximum likelihood parame- 
ters for each of these synthetic histograms, and use 
the distribution of these to describe the probability 
distribution for the parameters. 

4.3 Separation by distribution func- 
tion 

To evaluated the likelihood function 1)29(1 . we need a 
parameterized model pdf for the data histogram at 
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217 GHz. To get unbiased estimates of the param- 
eters, our model must faithfully reproduce the data. 
We proceed by making models of each signal in turn. 
The pdf of the sum of independent random variables 
is given by the convolution of the pdfs of the indi- 
vidual random variables. Therefore, by modeling the 
pdfs of the individual signals, we can produce a trial 
pdf for the total data by convolution. We use binned 
functions for our models, and evaluate the convolu- 
tion with fast Fourier transforms. Our procedure is 
not very sensitive to the width of the bins unless the 
model pdfs contain sharp features. In this applica- 
tion, our pdfs arc smooth functions, and we find that 
roughly one thousand bins covering the pdf suffices. 

We make a 2 parameter model for the pdf. Our 
parameters are the quantities we seek to estimate, in 
this case the standard deviation of the CMB, which 
we label ctcmBi and the standard deviation of the 
kSZ, which we label Uksz- For the model pdf of the 
CMB we simply use a Gaussian with variance <Jq^^ . 
For the kSZ pdf, we use the actual histogram of the 
kSZ as a template. In the real experiment, this would 
not be available, but could be based on simulations. 
To generate a distribution with a given standard de- 
viation we rescale the template, preserving the dis- 
tribution's unit integral. For the model pdf of the 
point sources, we again use the actual point source 
histogram as a template. If we understand the point 
source population well enough to make a good power 
spectrum analysis, such that we wish to attempt a 
separation of CMB and kSZ, the we should be able 
to model the point source pdf. Note that our non- 
Gaussian analysis will provide a constraint on ctcmb 
and (Tksz individually. Contrast this to the power 
spectrum analysis, which constrains only the total 
variance cr^j^g -I- cr^sz' ^^Ibeit in a narrower window 
of multipoles. 

We apply our maximum likelihood parameter esti- 
mate to this model. The result is a > IOct detection 
of the CMB and a > 7a detection of the kSZ in our 
window (see Figure EJ. The estimate is somewhat 
degenerate in the direction which preserves the total 
variance ct^mb + <^ksz- 

The power of this method arises from the tem- 
plate for kSZ, which fixes the relation between the 
kSZ pdfs variance and tails. If this relationship is 
not fixed, then a narrow kSZ distribution with wide 
tails could mimic a wide kSZ distribution with nar- 
row tails, by adjusting the variance of the CMB pdf 
to compensate. Also, it is crucial to have the point 
source contribution constrained. Otherwise it would 
be degenerate with the CMB. 

We must depend on simulations for the relation- 
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Figure 11: Separating CMB from kSZ using the pdf. 
This plot shows the result of our fitting procedure 
for the standard deviation of the kSZ and CMB. The 
"-f" shows the best fit point. The "x" marks the 
true value. The dots show the distribution of the fits 
for 1000 bootstrap reahzations. The contour denotes 
the 68% region computed from the covariance of the 
bootstrap points (x^ with 2 degrees of freedom). The 
fine dotted line shows the circle of constant total vari- 
ance which passes through the true value. 

ship between the variance and tails of kSZ. This non- 
Gaussian method may also be viewed as a consistency 
check for kSZ template simulations. A simulation of 
kSZ must both satisfy the power spectrum at high /, 
and provide the correct non-Gaussianity where kSZ 
and CMB overlap. If the measurements differ, one 
should not have confidence in the template power 
spectrum for kSZ provided by the simulation. 

5 Conclusions 

We have developed simulations of the ACT experi- 
ment in three frequency bands and analyzed the abil- 
ity of ACT to extract the power spectrum and non- 
Gaussian statistics from it. The simulations include 
CMB, kSZ, SZ, and radio and infrared point sources. 
Secondary anisotropies dominate the CMB over a 
wide range of interest. These in turn are heavily pol- 
luted by point sources. Infrared point sources must 
be dealt with in the power spectrum, and cannot be 
completely dealt with by masking. While we included 
the latest constraints on the secondary anisotropies, 
significant uncertainties on the amplitude and corre- 
lation properties remain and could affect the conclu- 
sions reached in this paper. Particularly uncertain is 
the contribution of patchy reionization to kSZ. 
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An optimal multifrcqucncy filter can extract the 
power spectrum, but the results are subject to as- 
sumptions put into the analysis. We considered 
several cases, making different assumptions about 
our knowledge of contaminating extragalactic point 
sources. For optimal extraction knowing both the 
mean and scatter of the frequency dependence of 
point sources is crucial. If we do not use a prior model 
for the frequency dependence of the point sources we 
may still extract the power spectrum assuming spa- 
tial correlations are known, but the results may still 
be biased. Furthermore, using only frequency infor- 
mation in a power spectrum analysis, kSZ and CMB 
cannot be separated using data alone. This degrades 
the ability to measure the CMB at I > 2000 and to 
determine parameters from it. 

Wc also explored the ability to use the non- 
Gaussian information to constrain further the indi- 
vidual components. Non-Gaussianities in the his- 
togram of pixel temperatures at 217 GHz should 
be observable by ACT. Non-Gaussianities created 
by kSZ can allow one to distinguish it from Gaus- 
sian CMB on scales where the two cannot be sep- 
arated with the power spectrum analysis, assuming 
that simulations are able to provide an accurate tem- 
plate of kSZ. It is unclear how well a kSZ template 
from simulations will work when applied to, for exam- 
ple, kSZ created by patchy reionization. Therefore, 
our results are preliminary and more investigation of 
these techniques is needed. Combining the power 
spectrum template analysis with the non-Gaussian 
template analysis allows one to perform consistency 
checks among these methods. 

Our results suggest that extracting the primordial 
power spectrum information at high precision from 
small scale primary CMB will be challenging. Scat- 
ter in the frequency scaling of point sources, as well 
as their possible correlations, make the point source 
separation from CMB difficult. Assuming this is ac- 
complished, removing kSZ is even more difficult and 
can only be done with reliable templates for the power 
spectrum and for non-Gaussian signals. The final 
precision is likely to be limited by the modeling un- 
certainties and not by the statistical precision of the 
observations. It is hard to prognosticate the final ac- 
curacy given how uncertain the kSZ amplitude from 
patchy reionization is. In this context it is worth men- 
tioning that CMB polarization has fewer secondary 
anisotropics to worry about than the CMB tempera- 
ture anisotropy itself and so may ultimately be more 
promising as a tool to study the primordial power 
spectrum fluctuations on small scales. 
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